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Abstract 

We consider parametric hypotheses testing for multidimensional ergodic diffusion processes ob- 
served at discrete time. We propose a family of test statistics, related to the so called (^i-divergence 
measures. By taking into account the quasi-likelihood approach developed for studying the stochastic 
differential equations, it is proved that the tests in this family are all asymptotically distribution free. In 
other words, our test statistics weakly converge to the chi squared distribution. Furthermore, our test 
statistic is compared with the quasi likelihood ratio test. In the case of contiguous alternatives, it is also 
possible to study in detail the power function of the tests. 

Although all the tests in this family are asymptotically equivalent, we show by Monte Carlo analysis 
that, in the small sample case, the performance of the test strictly depends on the choice of the function 
(j>. Furthermore, in this framework, the simulations show that there are not uniformly most powerful 
tests. 

Keywords: discrete observations, distribution free tests, generalized likelihood ratio tests, parametric 
hypotheses testing, quasi-likehhood functions, stochastic differential equations 

1 Introduction 

In the last years there has been a growing interest around diffusion processes defined by means of stochas- 
tic differential equations. Indeed, diffusion models are useful for describing the random evolution of real 
phenomena studied, for instance, in physics and biology. These random processes have often been ap- 
pUed in mathematical finance and econometric theory to describe the behavior of stock prices, exchange 
rates, interest rates. Although, the underlying random model evolves continuously in time, the time data 
are always recorded at discrete instants (e.g. weekly, daily or each minute). For these reasons the infer- 
ence problems for discretely observed diffusion processes have been tackled by several authors in order 
to provide useful statistical tools for the applied researchers and practitioners. 

Let Xt, t G [0,T], be a rf-dimensional diffusion process solution of the following d-dimensional 
stochastic differential equation dXt ~ b{a, Xt)dt + (j{I3, Xt)dWt, where functions b and a are suitably 
regular and known up to the parameters a g M^* and /3 e R'^. The process Xt is discretely observed at 
times ti, such that ti — = A„ < oo for 1 < i < n. In order to test the parametric vector 6 = {a, /?) 
of the process Xt, t e [0, T], this paper proposes the construction of a family of test statistics for the 
following hypotheses testing problem 



Ho:e = 0o versus Hi : 6 ^ 



'0- 
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The problem of testing parametric hypotheses for diffusion processes is still a developing stream 
of research. In continuos time, Kutoyants (2004) and Dachian and Kutoyants (2008) considered the 
problem for ergodic diffusion models; Kutoyants (1984) considered the same problem for small diffusion 
processes. In discrete time, Lee and Wee (2008) dealt with a parametric version of the score marked 
empirical process test statistics, while Ait-Sahalia (1996), Giet and Lubrano (2008) and Chen et al. (2008) 
proposed tests based on the several distances between parametric and nonparametric estimation of the 
invariant density of ergodic diffusion processes. 

The test statistics introduced in this paper has been inspired by the 0-divergence theory that we recall 
briefly in the following. Let {p{X, 9),9 E Q} be a family of probability densities. Denote by Ee the 
expected value with respect to Pg, the true law of the observations X. Let (p : [0, oo) -> M be a convex and 
continuous function. Furthermore, its restriction on (0, oo) is finite, two times continuously differentiable 
and such that 0(1) = 4''{1) = and (f>"{l) = 1. Then the 0-divergence measure between the two models 
p{X, 6) and p{X, 0o), ^ 9q, is defined as 

Z,,(M.)^E.„*(g^) (LI) 

We remind that i/i-divergences are contain as special cases many divergences like the a-divergences 
(Csiszar, 1967) the Kullback-Leibler and the Hellinger divergences the Renyi's divergence, the power- 
divergences studied in Cressie and Read (1984). 

The 0-divergence measures have been used for testing hypotheses in parametric models. The reader 
can consult on this point, for example Morales et al. (1997) and Pardo (2006). Morales et al. (2004) 
applied modified Renyi's divergence for testing problems on the family of exponential models. 

Given a sample of n independent and identically distributed (i.i.d.) observations and some asymptoti- 
cally efficient estimator 9n, to test Hq : 9 = Oq against Hi : 9 9o the 0-divergence test statistic is given 
by Dcf,{9„ , 9o). For a one-dimensional diffusion process with /3 = /3* assumed known, the 0-divergence 
is formally given by 

Dc^{9,9o)= J c^(^^^dPe, (1.2) 

where 

dPe _ \ r b{a,Xt)-b{ao,Xt) ^^ ^ 1 b^a, Xt) ^ b^jap, Xt) 

d^^^'^piX — ^WJ^) — ^""^'sX — ^{FJQ — ''t 

The study of ^-divergences for continuous time observations of diffusion processes has been considered 
in Vajda (1990). Explicit derivations of the Renyi information on the invariant law of ergodic diffusion 
processes have been presented in De Gregorio and lacus (2009). Kiichler and S0rensen (1997) provide 
several results on the likelihood ratio test statistics statistics for exponential families of diffusion pro- 
cesses. For small diffusion processes, Uchida and Yoshida (2004) derived information criteria using 
Malliavin calculus. For discrete time observations, Renyi divergence measures has been considered in 
Rivas et al. (2005). 

Formula ( 11.2b is not useful for testing problems on discretely observed diffusion processes. Therefore, 
we take into account an alternative approach. Let us consider the following statistic 

where pi{-) is a suitable (gaussian) approximation of the transition density of the process Xt from Xt^^-^ 
to Xt- . Notice that the function 2?0.„ is not a true 0-divergence, nor a proper approximation of it, but its 
behaviour is studied in Section 4. 
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Let 9n be any consistent estimator of 9, then the family of test statistics is defined as follows „(6'„, 9o) := 
2nD0 „(6'„, 6'o). By exploiting the quasi-likelihood approach developed by Genon-Catalot and Jacod 
(1993), Florens-Zmirou (1993), Kessler (1997) and Yoshida (1992, 201 1), we derive the asymptotic dis- 
tribution of the tests under the null hypothesis and under the case of contiguous alternatives. 

The paper is organized as follows. In Section|2]we introduce the notations, the model, the regularity 
conditions and the asymptotic framework. Section |3] contains preliminary results needed in Section |4] 
where the family of test statistics are introduced and studied. Section |5]is devoted to numerical analysis 
of the performance of the tests for small sample sizes. The methodology developed in this paper can be 
applied to other diffusion models, Section|6]discuss this point. The Appendix contains some auxiliary but 
useful results for the proofs of this presented in this work. The tables are collected at the end of the paper 

2 Notation and basic assumptions 

Let Xt, t E [0, T], be a d-dimensional diffusion process solution of the following multivariate stochastic 
differential equation 

dXt = b{a,Xt)dt + a{P,Xt)dWt, Xa = xo, (2.1) 

where a = (ai, ap)' e Qp C W, p > 1, /S ^ e Qq (ZW, q > 1, are p x 1 and g x 1 

vectors respectively, 6 : Op x M'' ^ R'^, cr : Gg x R'^ R'' x R"' and {Wt, < t < T}, is a standard 
Brownian motion in R"'. We assume that the functions b and a are known up to the parameters a and /3. 

The sample path of Xt is observed only at n + 1 equidistant discrete times tj, such that tj — ti^i = 
A„ < oo for 1 < i < n (with <o = and t„ = T). The asymptotic scheme adopted in this paper is the 
following; T = 7iA„ — >■ oo, A„ — > and nA^ — as n — > oo. The previous scheme is called rapidly 
increasing design, where the number of observations grows over the time but no so fast. 

The following notations will be used throughout the rest of the paper: 

• We denote hy 9 = {a, /3) G 8p x 9<j = 9 the {p + q)xl parametric vector and with Oq = (ao , /?o) 
its unknown true value. The parameter space 9 is a compact set of RP+''. 

• X„ = {Xt-}o<i<n represents our random sample with values in Mt'^+i)^*^. 

• For a matrix A, we denote by A^^ the inverse of A and by \ A\'^ = tT{AA'), i.e. the sum of squares 
of the elements on the diagonal of matrix A. 

• Wesetl](/3,x) = cr(/3, a;)cr(^, x)', S(/3, x) = I]-i(/3, a;) andX,(a) ^ Xt^-Xt,_,~Anb{a, Xt,). 

. For/i = ifiu...,fim), d^, ^f^, := 9^^^, ^ (9^, , a^,J' and 

= [d'^^^,]k,k'=i,...,m denotes the Hessian matrix. 

• If / : 9 X M'' ^ R, we denote by f,{e) the value f{9,Xt,)- for example = E(^,XtJ. 
Furthermore, if / is a tensor, we indicate with the upper index its components; when / is a matrix 
ji,m represents its (/, m)-component. 

• For < i < n, = iA„, = cr(Ws,s < U). 

• Let Un be a R-valued sequence. We indicate by i? a function 9 x R'' — >^ R for which there exists a 
constant C such that 

R{9, Un, x) < M„C(1 + 1x1)'^, 9e 9, X eW^^ne N. 
We need some assumptions on the regularity of the process Xt,t £ [0,T]: 
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Ai ■ there exists a constant C such that 

\b{ao,x) - b{ao,y)\ + \a{/3o,x) - cr(/3o,y)| < C\x - y\; 
A2. inf^,:,det(S(;3,a;)) > 0; 

A3 . the process Xt,t & [0, T] , is ergodic for 9 — 9o with invariant probabiUty measure ne ■ Thus 

as T — > 00, where / is a /.jg-integrable function. 

A4. if the coefficients b{a,x) ~ b{ao,x) and a{l5,x) — (T(/3o,a;) for all x (/ie^-almost surely), then 

a = ao and /? = /3o; 

A- for all m > and for all g e 6, sup^ E\Xt\'^ < 00; 

Ae- for every 9 G Q, the coefficients 6(0;, x) and <j{/3, x) are five times continuously differentiable with 
respect to x and the derivatives are bounded by a polynomial function in x, uniformly in 9; 

At- the coefficients b{a, x) and cr(/3, x) and all their partial derivatives with respect to x up to order 
2 are three times continuously differentiable with respect to 9 for all x in the state space. All 
derivatives with respect to 9 are bounded by a polynomial function in x, uniformly in 9. 

We observe that the assumption Ai ensures the existence and uniqueness of a solution to ( 12.11 ) for the 
value 6*0 = (q!o,/?o) of 6* e 9, while Ai is the identifiability condition. Hereafter, we assume that the 
conditions Ai — A7 hold. These conditions are equivalent to the ones in Uchida and Yoshida (2005) and 
Kessler (1997) for what concerns the regularity of the model. 

3 Preliminary results 

Since the transition density between Xt-_-^ and Xt- is almost always unknown, for the estimation of 
stochastic differential equations it has been developed an alternative tool with respect to the likelihood 
function. Therefore, in order to introduce the test statistics for the stochastic differential equation ( 12.11 ). 
we consider a quasi-likelihood approach based on a suitable contrast function. In other words, we deal 
with the negative quasi-loglikelihood function H„ : m("+^)^'' x 9 — )- M 

H„(X„,0) := ^H,(0) := 2 E logdct(E,_i(/3)) + —X[{a)E,^^i^3)X^{a) ■ (3.1) 
i=i i=i " ' 

The function ( 13. Il l is obtained by discretization of the continuous time stochastic differential equation 
(12.11) by Euler-Maruyama scheme, that is 

Xt,-Xt^_,^ r 6(a,X,)ds+ /' a{p,Xs)dWs 

^ bia, J A„ + a(/3, Xt^_,){Wu - Wu_,) 

and the increments {Xt- — Xt-_-^) are (approximately) conditionally independent Gaussian random vari- 
ables for i = 1, n. 
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The quasi-likelihood ( 13. Il l has been used by, e.g., Florens-Zmirou (1993), Yoshida (1992, 2011), 
Genon-Catalot and Jacod (1993) and Kessler (1997) to make inference for stochastic differential equa- 
tions. This last author considered a more general asymptotic scheme, that is nAf^ 0,p > 2, and 
generalizes the contrast function ( 13. Il l improving the convergence results. For the sake of simplicity we 
focus our attention to the case nA^ — > 0. 

Let 0n : M("+i)><'^ ^ 6 be the quasi-maximum likeUhood estimator of 9 E Q, based on ( 13. Il l, that is 

On = (dn,/3n) = arg min (X„ , 6*) . 

G 



and let us consider the matrix 

ip{n) 



^i. 



where Ip and Ig are respectively the identity matrix of order p and q. The regularity conditions Ai — A7 
imply some fundamental results which have a crucial role for analyzing the asymptotic distributional 
behavior of the estimators (and thus of our test statistics as we will show in the next Section). Indeed, 
as shown in Kessler (1997) and Yoshida (2011), 6'„ is a consistent estimator of Oq and asymptotically 
Gaussian with rate of convergence given by ip{n)^^/^, i.e. 

v{n)-^/Hk - Oo) 4 N{0,I{9o)-^). (3.2) 
where T{0o) is the positive definite and invertible Fisher information matrix at given by 

T(f)n] - ( [^b'''(^0)]j,fc=l,...,p 

^ " I [I-'^-(^o)],,.=i...... 

where 



The matrix (p{n) plays the role of the rate of convergence in the estimation problem for the stochastic 
differential equation (12.1b . 

The Bayes type estimator 6'„ = (a„, /3„) for 9 is defined by 

/3„ = I y" cxp (H„(X„, («*,/?)) )^i(d/3) I X y"/3cxp(M„(X„,(a*,/3)))^i(d/?) 

cxp (^EI„(X„, (a, 7r2(dQ;)| x y aexp (^H„(X„, (a, /3„))^ 7r2(da) 

where a* is an arbitrary constant and 7ri(/3) and 7r2(a) are the prior distributions of /? and a respec- 
tively. Yoshida (201 1) proved that 6'„ is a consistent estimator for 9o and that the weak convergence ( 13.2b 
still holds for this estimator Other classes of estimators with similar asymptotic properties exist in the 
literature, for a recent review see S0rensen (2004). 

The next result on the score function will be useful in the study of the asymptotic behavior of the test 
statistics. 

Lemma 3.1. Let An{9) be a [p + q) x (p + q) matrix with {j, k)- component given by 
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A^'^^W = ^9e,H,(0)ae,H,(0), = 1,2,..., p + q. 

2=1 

Under the conditions Ai — Aj, the following property holds true 

Proof. For the sake of simplicity we denote by Xi := Xi{a), := 6i_i(a), := 
order to prove (13.31) . we deal with the following expressions 



(3.3) 
In 

(3.4) 



/,m— 1 



2 1 A, 



det(I],_i) 



fc = 1, ...,(?, 



and show that 



nA 



2 — 1 
1 



by means of Lemma lT4l 

Let us start proving the result (I3.6l l. Bearing in mind Lemma r7!2ll7.3l we get that 



1 " 



nAn 



n d 



z— 1 /,m— 1 I'.m' — l 
^ n d d 1 

iEE E 5t"i5^'T'sr-'r'5..feLia«.Ci + ^E^(^o,A„,x._i) 



i—l /,m— 1 /',m' — 1 
l.m—l I' ,m' — l 



and 



1 " 

-^E^«o{(5a,H,(0o)a„,H,(0o))'|5ILl} 



n2A2 

" i=l 



(3.5) 

(3.6) 
(3.7) 
(3.8) 



(3.9) 
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i—l I \l ,m—l I' ,m' — 1 

^ n d d 

^ n d d r 2 / 
^ \ \ \ /"/.m .m' o ,/ f) h^' \ { x^'iT^-fn-^rn' ,m' , r) / >pm,r7 



■i—l /,m— 1 I' ,m' — l 



^'0 



•i?(0o,A',^,-i) 
(3.10) 



The proof of the result ( 13.71 ) is developed as follows 

1 " 



1=1 



/,77l — 1 



E ^^^r^ 



./ , gft. det(S,_i) 
det(I],;_i) 



r .m' — 1 



det(S., 



i-1 



■i—l k /,m— 1 V .m' — l 

+ det(E,_i) E ^^--1^-1+ det(I],_i) E ^^^--1 ^-1 +i?(^o,A„,X,_i) 



/,m— 1 

9ff^.det(£,_i)a^^,det(S],_i) 
det(S]^_i) dct(E,_i) 



I' .ni' — l 



^ n d d ^ n 

^E E E 5,,st™9,,I]flfs:^ls:i7' + i-^(i?(0o,A^,AVi) 



i=l l.m—1 l\m' — 1 i—l 

where in the last step we have used the following relationship 

dp, det(£,_i) 
dct(E,_i) 

and, once again, Lemma lT2ll7.3l 
In order to prove that 



-tr[a^,S,_iS]i_i] = tr[a/3,Si-iSi_i] 



1 

-Y^EeA{d,3M(^,)dp,m.,{e,)f\gU} ^' o 



we use the same arguments in ( 13.10b and further note that 



(3.11) 
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which is a consequence of Lemma fTTTl By using the previous arguments, we get that 

1 " 

j= ^ Ee,{dc.M.{Oo)dMOo)\Gr.,} 



nJA, 



1 

2r^^/A7 



il.m—l 



I' .m' — l 



det(I],_i) 



i—l \ l^nid' ;m' — l 



/,m— 1 
1 ^ 

= ^E[^(^0'\/^'^'-i) + ^(^0'^n^''^^-i)] 



and 



This last step concludes the proof. 



— Y,EeA{daM,{Oo)dMeo)r\g:-^} 



□ 



4 The family of test statistics 

Let us remind that : [0, 00) ^ M is convex and continuous. Furthermore, its restriction on (0, 00) 
is finite, two times continuously differentiable. The goal of this section is to construct a family of test 
statistics for the following hypotheses testing problem 

Ha: 9 = 9a, vs Hi : 9 =^ 9a 

concerning the stochastic differential equation (12.1b . To this aim, let us consider the following quantity 



1 " 

n 5Z ' 



MO) 

P^{.Oa) 



(4.1) 



where Pi{9) := exp(Hi(0)) and M.i{9) is defined as in ( 13.1b . The statistic 2?0.„(0, 0o) represents the 



empirical mean of the functions 1 



which measure the discrepancy between two (approximated) 



parametric models given the sample X„. The statistics 2?0.„(0, ^o) is not an approximation of the 
divergence (II. lb because it does not converge to 



but it proves to be useful in the construction of a new class of asymptotically distribution free test statistics 
as we will see in what follows. The next result establishes the convergence in probability of 'D^^niO, 9o). 
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Theorem 4.1. Under the conditions Ai — At, we have that 



Pen 



(4.2) 



uniformly in 9, where 

C/0(/3,/3o) 



/ / dct(S](/3,x)) 
'^Udet(S(/?o,.T)) 



/ /dct(E(/3,a-)) 



Vdet(I](/3o,x)) 

Meo (da:) 



(4.3) 



Proof. Let us define 



= * (( dt'g::;(W) )'°''{2Z; (T:(o)H.-(fflX,,o) -X:K)E._,(A)X.(a.) 
By setting x{a) — [x ~ Xi_i — A„6i_i(a)), we observe tliat 



Vdct(S,_i(/3o)) 



2A„ 



/, 771 — 1 



;,m=i 



and 

^„ I /det(S,_i(/3)) 



Vdet(I],_i(/3o)) 
/det(I],_i(/?)) 



exp ' 



^ ^ (x'(a)S:L"U/3)x"(a)-5f'(ao)S^^"i(/3o)a:"(«o)) 



2A„ 



/ m— 1 



7^)^^P E (s'(a)S:-:i(^)x™(a)-5j'Mst"l(/?o)x-(ao) 



/,m— 1 



Vdet(I], 



^, / /det(E,_i(/?)) 



^ ^ (;r'(a)5l^l(/3)x"(a)-x'(ao)S[L"l(/3o)x™(ao) 



/,m— 1 



/ det(E,_i(/3)) 
Uct(S.-i(/3o)) 



cxp ■ 



^ ^ (x'(a)S:L"l(/3)x™(a)-x'(ao)S:L"(/3o)x"K 



Lm— 1 



m— 1 

, / /det(Ei_i(/3)) 



Vclet(E,_i(/3o)) 



exp 



2A, 



/.m— 1 



/ det(E,-i(/3)) 
Vdet(Ei_i(/?o)) 



cxp ' 



Lm— 1 



x-^(S^(/3)-S:e\(/3o)) 

By taking into account ( 17. Il l, one has that 

Seo{i^(XO|gr-i} 



fc = l k.k' = l 

^ [iS§nm) ' 1^ ^ (bLi(a)Ht"i(/3)6r-iW - 6Li(ao)Ht"l(/3o)C-i(«o)) | j 

-A„g6t,(a„)|<^' ( d't{Kl^{2)) ) " I ^ (&ti(«)HL™i(/3)C-i(a)-&Li(ao)Ht"l(/3o)6r-i(ao)) 

" I I II ] 



/ det(S.-i(/3)) 
Uet(E,_i(/3o)) 



exp • 



An 



1,171 — 1 



/ det(Si-i(/3)) 
l^det(Ei_i(/3o)) 



E (H,^l7(/3)&"i(a) -Ht7(/3o)C-i(ao)) 



+ 



i ,m — 1 
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^ (Hf-(/3)6r-i(a) -Ht7(/3o)&r-i(ao)) 



i ,m — 1 



Therefore, the following result holds 



^Es,„{F(xoier-i} 
1=1 



lQ)-6Li("o)Ht"l(/3o)6r-i(«o) 
U det(E~f(A)T) ^ '^^P^~y^ E (&Li(a)Hl:!:i(/J)&ri(a) -feLi(ao)Hi'ri(^o)&,™-i(ao)) 



/ / det(E,_i(/3)) 



/ det(E,_i(/3)) 



^ det(S^'-7(A0) J ®^P|~2^ E (&'-i(a)2l:ri(^)&ri(a) - &Li(ao)Hi'"i(^o)&ri(ao)) 

/ — ' n ^ — ' 71 ^ — ' 

fe.fc'=i J i=i i=i 



f/0(/?,/3o) 



(4.4) 



uniformly in 6, where in the last step we have used Lemma lTJl 
Now, by means of the same arguments it is not hard to prove that 



1 



(4.5) 



In conclusion Lemma IT4l and the results ( 14.41 ). ( 14.51 ) implies the statement of the present Theorem. 



□ 



We point out that for 0(.t) = log(x), from Theorem l4.1l we derive that 

, / det(I](/3, a;)) 



tr(S(/?,.T)I](/3o,a;))-rf + log 



Vdet(I](/3o,x)) 



which coincides with the result (4.1) in Kessler (1997). 

Remark further that I?0,„(6', 6*0) can be used as a contrast function to derive minimum contrast esti- 
mators 9n which solve l?0,ri(^ri, ^0) = whose properties can be studied using Theorem l4.1l 

Hereafter we assume, as before, that (j) : [0,oo) — M is convex and continuous, its restriction on 
(0, 00) is finite, it is two times continuously differentiable and, in addition, that (f>{l) = (p'{l) = and 
0"(1) - 1. 
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Now introduce the family of test statistics defined as follows 

The above quantity is similar to that used by Morales e al. (1997), where D^^„ {S, Ovi) is replaced by the 
true (/)-divergence. 

Let On be the quasi maximum likelihood of the Bayes-type estimator defined in Section [3] The first 
step is to prove that the family of test statistics 

T^,„(4,0o) (4.6) 

is asymptotically distribution free under iJo- 

Theorem 4.2. Under Hq and the conditions Al — A7, as n ^ oo, we have that 

T^,ni^n,0o) ^ Xl+r (4.7) 
Proof. By Taylor's formula, we have that 

= nl?^,„(0o, ^o) + ndeV^^nido, do){0n - do) + ^{On - eoYnd^gV^^niOo, OoM, - 9o) + op(|e"„ - 9o\^) 

where in the last step we have used the fact that 0(1) — (/>' {1) = 0. We note that the (j, fc)-element of the 
Hessian matrix dgV^^niS, Oq) is given by 

and then 

n 

d9l^fn((^o,0o) = - Vae,H,(0o)5e,H,(0o) = -Ai^(?o) 

i=l 

By taking into account Lemma 13.11 and the convergence result ( 13.2b . the statement of the Theorem 
follows immediately. 

□ 

Given the level a, such test rejects if T^.n > where Cq is the 1 — a quantile of the limiting 
random variable Xp+q- The power function of the proposed test is equal to 

P'^iO) = Pe [t^A^u. Oo) > c,} , 9^ Oo. 

This power function can be studied under the contiguous alternative setup. Indeed, in this case we are able 
to approximate f3^^ (9) by means of a distribution function of a non-central chi square random variable. 

Theorem 4.3. Under the conditions Al — A7, Ho : = Oo and the alternative contiguous hypotheses 
Hi : = 0o + >fi{n)h, where h e {h e RP+'^ : 9 = 9o + >f{n)h e 9}, we have that 

/3;'(^^) = l-Fp+,(c„), (4.8) 

where Fp+g(-) is the cumulative function of the random variable Xp+gCA*) which is a non-central chi 
square random variable with p + q degrees of freedom and noncentrality parameter fi = h'I{9o)h. 
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Proof. Under Hi : 9 = Oq + Lp{n)2h we have that 

f{n)-i{e,-, - 9o) = f{n)-He,, - 6) + h A N{h,I{9o)-^) (4.9) 

where in the last step we have taken into account the convergence in ( 13.2b and that sup„ \I{9) —I{do)\ = 
op(l) which is implied by the continuity of I{0) in Oq. Therefore, by means of the same arguments 
exploited in the proof of Theorem l4.2l we can write that 

T^,n0n, Oo) - {{On ~ 60)^^-^^^)' ^{n)^/^nd''gV^^^{9o, eoMnf'^^{n)-^'\L - Oo) + op(l) 

^ Xl+q{p) 

This last step concludes the proof. 

□ 

It is well known that the likelihood ratio test is the uniformly most powerful test for all sample sizes 
for testing 6*0 against any simple alternative 9i . In the framework of this paper, the natural benchmark 
test statistics is the generalised quasi-likelihood ratio test (GQLRT), that is 

S,i{6n,do) = 2?i2?iog,„(6'„, 6*0) = 2[E1I„(X„,6'„) — ]HI„(X„, ^?o)] 

(i.e. when 9q is estimated using the maximum likelihood estimator). We observe that Sn{9m&o) is not 
a member of the family of tests ( 14.6b . because it could be obtained from ( 14.6b for (t){x) = log(a;) which 
does not satisfy the requirement of this Section. Nevertheless, the limiting distribution of GQLRTs is the 
same of T^^n{On, Oq) as it is easy to verify, namely 

This implies, as expected, that all test statistics „(6'„, ^q) are asymptotically equivalent to the Sn{0n, do)- 
Next section investigates the behaviour for small sample sizes. 

5 Numerical analysis 

Although all test statistics T^.^ of Section |4] satisfy the same asymptotic results of Theorems 14.71 and 
14.8! for small sample sizes the performance of the test is determined by the statistical model generating 
the data and the sample size. In our numerical study we will consider the power of the test under local 
alternatives as in Theorem 14.81 for different 4> functions, sample sizes of n = 50, 100, 250, 500, 1000 
observations and T ~ na, in order to satisfy the asymptotic theory. For testing Oq against the local 
alternatives 6*0 + for the parameters in the drift coefficient and Oq + for the parameters in the 

diffusion coefficient, h is taken in a grid from to 1, and h = {) corresponds to the null hypothesis ffo- 
We consider the following functions, which are all such that (^{l) = (f)'{l) = and — 1. 

• 4>{x) = 1 — x + x log(x): the true 0-divergence based on this function is equivalent to the Kullback- 
Leibler divergence, but in our setup this is not true. We use the label AKL in the table for this 
approximate KL; 

• (j){x) = xij^\\^~^'' 1 ^ —1,0: this corresponds to the power divergence studied in Cressie 

and Read (1984). We use A = fc in the tables, with k = -30, -10, -3; 

• (j){x) = (f^) • this was proposed in the Balakrishnan and Sanghvi (1968), we name it BS in the 
tables. 
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For the data generating process, we consider the following statistical models 

OU; the one-dimensional Ornstein-Uhlenbeck model solution to dXt = {a~ /3Xt)dt + adWt, Xq = 1, 
with do = {a, /3, a) = (0.5, 0.5, 0.25); 

GBM: the one-dimensional geometric Brownian motion model solution to dXt = {a—f3Xt)dt+aXtdWt, 
Xo = 1, with 6*0 = (a, /3, a) = (0.5, 0.5, 0.25); 

CIR: the one-dimensional CIR model solution to dXt = (a — (3Xt)dt + a^fXl^Wt, Xq ~ 1, with 
6*0 = (a,/3,cr) = (0.5,0.5,0.125); 

MOU: the two-dimensional Ornstein-Uhlenbeck model solution to dX^^ = (2 — [i.\X\^^)dt + a\(Sw'f^ , 
dxf^ ^ (2-/i2Xf^)dt + (T2diyf\Xo = (1,1), with = {fii,fi2,(Ji,<j2) - (1,1,0.3,0.5); 

In all experiments, the process have been simulated at high frequency using the Euler-Maruyama scheme 
and resampled to obtain n — 50, 100, 250, 500, 1000 observations. The T^.n statistics is constructed 
using the quasi maximum likelihood estimator. Each experiment is replicated 1000 times and the Tables 
[T]|2]|3] and|4]contain the empirical power function, i.e. the average values 

a _ #{r«,0(^n,6'o + h/ipn) > Cg} 

" 1000 

and the empirical level of the test 

" ~ 1000 

where Cq, is the estimated a quantile of the empirical distribution of T^.n, with a = 0.05. The choice of 
using the empirical threshold instead of the theoretical threshold from the distribution, is due to 
the fact that otherwise the test are non comparable, i.e the level of test are not a and, for example, when 
/i = the test for different (/)'s produces different empirical level of the test. 

In the Tables [1] |2] [3] and|4]we have used the bold face font to put in evidence the test statistics with 
the highest empirical power function for a given local alternative /i > 0. As mentioned before, the 
natural benchmark test statistics is the generalised quasi likeUhood ratio test GQLRT. 

From the numerical analysis we can see several facts 

• the test statistic based on the AKL <^ function is not equivalent to the GQLRT as in the case of true 
0-divergence test statistics; 

• the GQLRT test statistics appears to be (almost) uniformly more powerful for small h when the 
sample size is small (n — 50) but, as soon as the sample size increases, other divergences test 
statistics „ have higher power; 

• in all cases, the GQLRT is not most powerful test for all alternatives and most of the times, the 
power divergences are more powerful; 

• there seems to be no uniformly most powerful test in among the set of 0-test statistics proposed 
here. 

In conclusion, this paper proposed a new family of test statistics for discretely observed diffusion 
processes. Although closely related to the 0-divergence test statistics, the test proposed here is different 
and new. The empirical analysis shows that, despite the asymptotic equivalence of all test statistics 
proposed here, for small sample size and different statistical models, there is no uniformly most powerful 
test among the members of this family. Furthermore, the generaUsed quasi likelihood test statistics, which 
does not belong to this class, is not necessarily the optimal test as well. 
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6 Extension to other classes of diffusion processes 



The methodology developed in this paper is quite general and under suitable conditions can be applied for 
testing other ergodic diffusion models observed at discrete times. Therefore, in this Section we discuss, 
briefly, some possible extensions of the test statistics ( 14.6b without delving into the technical aspects. 

Let us start considering a generalization of ( 12.11 ) given by an ergodic multidimensional diffusion 
process with jumps, that is 

dXt^b{a,Xt-)dt + a{/3,Xt-)dWt+ [ c{Xt^,z,a)p{dt,dz), Xq = xq, (6.1) 

JE 

where p{dt, dz) is a Poisson random measure on M+ x E', i? = K'' — {0} and g"(dt, dz) is its intensity 
measure, that is E{p{dt, Az)) = q°'{dt, dz). We set g°(dt, dz) = fa{z)dzdt and = A(a)F„(z) 

where A(a) is a nonnegative function and Fa{z) represents a probability density. The parametric esti- 
mation of ( 16.1b has been tackled by Shimizu and Yoshida (2006) and Ogihara and Yoshida (2011) with 
a similar but slightly different contrast function. Shimizu (2006) dealt with M -estimators for the same 
statistical problem. Now, we give some sketches on the contrast function used in the previous papers 
to which we refer for major details about the parametric estimate of the stochastic differential equation 
model ( 16.1b . In particular, Ogihara and Yoshida (201 1) introduced the following quasi-likelihood function 

H„(X„,^) 

■= "2 E |l°gd'^t(^-i(^)) + ^^i(a)S,-i(/3)X,(a)| l{|x,,_;f,__j<^Afj 

i— 1 ^ 

71 

|X(.-Xt__J<DAP} 

i=l 

-A„y" *„(X,_i,y)(p„(X,_i,2/)d2/} (6.2) 

where 13 > 0, p is a suitable constant, ^{y,x) = fg{c~^{x,y,9))\J{x,y,d)\ where J{x,y,9) is the 
Jacobian of c^^{x, y, 9) and < (^Sn (a^, y) < 1 is a sequence of real valued functions satisfying condition 
[HIO] in Ogihara and Yoshida (201 1). 

The introduced contrast function is very natural since it is split in two parts: the first component 
is the contrast for an usual diffusion process, and the second one emerges from the discretization of the 
likelihood function of an compound Poisson process with Levy density . Therefore, \f\Xt-~Xt-_^ \ < 
ZJA^ the function H„(X„, 9) judges no jumps occur in the interval t;] and it reduces to the quasi- 
likelihood function for a diffusion process without jumps; otherwise H„(X„, 9) judges a jumps occurs in 
the previous time interval. 

By taking into account a rapidly increasing scheme, under suitable assumptions, Ogihara and Yoshida 
(2011) proved that the quasi-maximum likelihood estimator and a Bayes type estimator based on ( 16.2b 
are consistent and asymptotically gaussian. Furthermore, it is possible to prove that Lemma [TT] holds. 
This implies that the test statistics T^,n, opportunely modified for testing (16.11) . weakly converges to a 
chi-squared random variable. 

Another random model which attracted the attention of the researchers has been the small-diffusion 
process. In this case we consider the following stochastic differential equation 

dXt^bia,Xt)dt + ea{l3,Xt)dWt, Xo ^ xo, (6.3) 

where t £ [0, 1] and e e (0, 1]. S0rensen and Uchida (2003) and Gloter and S0rensen (2009), introduced 
the contrast function 
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n 

H,^„(X„,0) ^{logdet(S,_i(/3)) +£-2„Y;(a)S,;_i(/3)X,(a)} (6.4) 

i=l 

where Xi{a) = Xt- — Xt^_-^ — i&(Q;,XiJ. In this framework e as n — > oo. Under suitable 
conditions, it is possible to show that 6'j^„ = (de^„, /3e,„) = mine IHIe_„(X„, 6*) is consistent and 



/e ^(ae,„ - ao) 



where I{6) is the Fisher information matrix for ( 16.3b . Furthermore, Lemma [TT] admits a version based 
on the small diffusion process ( I6.3l l. Then the test statistics (I4.6l l constructed by means of (16.4b have the 
asymptotic properties proved in Theorem l4.7H4.8l 



7 Appendix 

We collect in the following some results which assume a crucial role in the proofs of the present work. 
Lemma 7.1. For k> 1 and ti > t > ti^i 

EeA\Xt - Xt._Jler-i} <Ck\t~ t,-i|(l + \Xt^_, 

Proof. It is an application of the Gronwall-Belman lemma (see the proof of Lemma 6 in Kessler, 1997). 

□ 

Lemma 7.2. Let Xi = Xt- — Xt^_^ — A„6(aoj Xt^). Under the assumptions Al-A7,for kj ~ 1, 2, d 
and j ~ 1,2,3,4, we have that 

EeAX-'ieti} ^ RiOo,AlX,^,) 

Eo^iX-'X-'lG-^,} ^ A„S^f (/3o) +i?(0o, A2,X,_0, 
Eoo{X X X \g"_j^} = R{9o,Af^,X,^i), 

+ R{eo,AlX,^i) 

Proof. Crucial for the proof is the following result. If / e C^'^'+^\ by applying Ito's formula repeatedly, 
we can prove that 

Ee{f iX,m-i} = J2 ^47(X.-i) A^+\X,_i) (7.1) 

k=0 

where 

d 1 ^ 

i—l — l 

represents the infinitesimal generator for Xt,t G [0, T] and Cg is the fcth interate of Ce- Furthermore Cg 
is the identity function. 
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We only prove the last equality in the statement of Lemma. The others equalities follow by similar 
arguments and require a less number of calculations. 

Let /(xi, X2, 2:3, X4) = nj=i(^j ~ ^i-i ~ ^nbiLiiao)), from (17. Il l for / = 2, we obtain that 
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si-i }^EeAf{x':\x':\x':\x':')\gt,} 



A2 



+i?(0o,A^,^.-i) (7.2) 



Therefore, we can write that 



■Ceo/(2;i,a;2,a;3,X4) 

= 6,^i(ao)(x2 - - A„6t^i(ao))(x3 - X,'!!, - A„6^i,(ao))(a;4 - X^^ - A„&^,(«o)) 
+ bt,{a^){xi - X,^i - A„6^i(ao))(a:3 - X^, - A,,btM)ixi - X^^ - ^nbtM^)) 
+ b1t,{a^){x^ - Xt^ - A„&^i(ao))(x2 - X^, - A„&,ti (ao))(a;4 - X,'^!, - A„6^i(ao)) 
+ &^i(«o)(xi - Xt, - A„6^i(ao))(x2 - X,^i - A^bt,{ao)){x; - Xf!, - A„6f^i(ao)) 

+ i|E^-^(/3o)(x3 - X^!i - A^ht,{a,)){x, - X,'^, - A„6fli(ao)) 

+ E^^^(/3o)(x2 
+ E^^(/3o)(x2 
+ sh'=-^(/3o)(xi 

+ Eh'^(/3o)(xi 
and thus 

Ar.Clj{xtiX-iXUX-i) = -4A^ n + ^{S-^f (/3o)&-^i(«o)&tli(ao) 

+ Eh''(/3o)6^:(ao)6^1iK) + sh''(/3o)6fiiK)&^iiK) 
+ Eh''(/3o)6^i(ao)6^iK)} 
= i?(0o, A3 , + i?(0o, A^ , (7.3) 

Similar and cumbersome calculations lead to the following equality 

An /'2 j^/^'^i -rF^a -rFfe4 ^ 



- ^l-l 




i{ao)){xA 




-A„6fli(ao)) 






l{oio)){xz 




~A„6fii(ao)) 






i{ao)){xA 




-A„6fli(ao)) 




- A ft*"! 


i("o))(a;3 




-A„6(=li(ao)) 




— A ft'^i 


i("o))(a;2 
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A2 

n 

2 



4 



A2(E^_if (/3o)S,^f (/3o) + S^^f (/3o)S-lf (/3o) + (/?o)sh'M/3o)) 



(7.4) 



Therefore, by taking into account the equalities ( 17.3b and ( 17.4b the expansion ( 17.2b leads to the result 
present in the statement of Lemma. 

□ 

Lemma 7.3. Let / : M'* x G — > M /je iMc/z that f is dijferentiable with respect to x and 9, with derivatives 
of polynomial growth in x uniformly in 9. Under the assumptions AI-A7, we have that 



-Y,fiX^-l,9) A" / /(x,0)/ieo(dx) 
" i=l •' 



uniformly in 9. 

Proof. See the proof of Lemma 8 in Kessler (1997). 



□ 



Lemma 7.4. IfUi, i = 1, 2, n are random variables Qi-measurable, then the two following conditions 
imply J2"=iUi ^ U: 

n 

Y,E{u,\g,^i}^u 

n 



Proof. See Lemma 9 in Genon-Catalot and Jacod (1993). 



□ 
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Table 1 : Empirical power function Ph, for different sample sizes n and local alternatives h. The empirical 
power and theoretical power is a = 0.05. Data generating model: the 1 -dimensional Ornstein-Uhlenbeck 
process. 
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Table 2; Empirical power function Ph, for different sample sizes n and local alternatives h. The empirical 
power and theoretical power is a = 0.05. Data generating model: the 1 -dimensional geometric Brownian 
Motion process. 
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Table 3: Empirical power function /Sh, for different sample sizes n and local alternatives h. The empirical 
power and theoretical power is a = 0.05. Data generating model: the 1 -dimensional CIR process. 
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0.592 
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1.000 
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Table 4; Empirical power function Ph, for different sample sizes n and local alternatives h. The empirical 
power and theoretical power is a = 0.05. Data generating model: the 2-dimensional Ornstein-Uhlenbeck 
process. 
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